AITopics | video self-supervised learning

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

Neural Information Processing SystemsDec-26-2025, 12:40:56 GMT

Video self-supervised learning (VSSL) has made significant progress in recent years. However, the exact behavior and dynamics of these models under different forms of distribution shift are not yet known. In this paper, we comprehensively study the behavior of six popular self-supervised methods (v-SimCLR, v-MoCo, v-BYOL, v-SimSiam, v-DINO, v-MAE) in response to various forms of natural distribution shift, i.e., (i) context shift, (ii) viewpoint shift, (iii) actor shift, (iv) source shift, (v) generalizability to unknown classes (zero-shot), and (vi) open-set recognition. To perform this extensive study, we carefully craft a test bed consisting of 17 in-distribution and out-of-distribution benchmark pairs using available public datasets and a series of evaluation protocols to stress-test the different methods under the intended shifts.

hidden dynamic, name change, video self-supervised learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.44)

Add feedback

Language-based Action Concept Spaces Improve Video Self-Supervised Learning

Neural Information Processing SystemsJan-20-2025, 01:53:30 GMT

Recent contrastive language image pre-training has led to learning highly transferable and robust image representations. However, adapting these models to video domain with minimal supervision remains an open problem. We explore a simple step in that direction, using language tied self-supervised learning to adapt an image CLIP model to the video domain. A backbone modified for temporal modeling is trained under self-distillation settings with train objectives operating in an action concept space. Feature vectors of various action concepts extracted from a language encoder using relevant textual prompts construct this space.

representation, train objective, video self-supervised learning, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.66)

Add feedback

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

Neural Information Processing SystemsJan-19-2025, 18:34:23 GMT

Video self-supervised learning (VSSL) has made significant progress in recent years. However, the exact behavior and dynamics of these models under different forms of distribution shift are not yet known. In this paper, we comprehensively study the behavior of six popular self-supervised methods (v-SimCLR, v-MoCo, v-BYOL, v-SimSiam, v-DINO, v-MAE) in response to various forms of natural distribution shift, i.e., (i) context shift, (ii) viewpoint shift, (iii) actor shift, (iv) source shift, (v) generalizability to unknown classes (zero-shot), and (vi) open-set recognition. To perform this extensive study, we carefully craft a test bed consisting of 17 in-distribution and out-of-distribution benchmark pairs using available public datasets and a series of evaluation protocols to stress-test the different methods under the intended shifts. For instance, we observe that while video models generally struggle with context shifts, v-MAE and supervised learning exhibit more robustness.

distribution shift, hidden dynamic, video self-supervised learning, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.87)

Add feedback

Filters

Collaborating Authors

video self-supervised learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

Language-based Action Concept Spaces Improve Video Self-Supervised Learning

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts